A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting
نویسندگان
چکیده
Audio fingerprint techniques should be robust to a variety of distortions due to noisy transmission channels or specific sound processing. Although most of nowadays techniques are robust to the majority of them, the quasi-systematic use of a spectral representation makes them possibly sensitive to pitch-shifting. This distortion indeed induces a modification of the spectral content of the signal. In this paper, we propose a novel fingerprint technique, relying on a hashing technique coupled with a CQT-based fingerprint, with a strong robustness to pitch-shifting. Furthermore, we have associated this method with an efficient post-processing for the removal of false alarms. We also present the adaptation of a database pruning technique to our specific context. We have evaluated our approach on a real-life broadcast monitoring scenario. The analyzed data consisted of 120 hours of real radio broadcast (thus containing all the distortions that would be found in an industrial context). The reference database consisted of 30.000 songs. Our method, thanks to its increased robustness to pitch-shifting, shows an excellent detection score.
منابع مشابه
SIFT-based local spectrogram image descriptor: a novel feature for robust music identification
Music identification via audio fingerprinting has been an active research field in recent years. In the real-world environment, music queries are often deformed by various interferences which typically include signal distortions and time-frequency misalignments caused by time stretching, pitch shifting, etc. Therefore, robustness plays a crucial role in music identification technique. In this p...
متن کاملLow-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain
Audio identification via fingerprint has been an active research field for years. However, most previously reported methods work on the raw audio format in spite of the fact that nowadays compressed format audio, especially MP3 music, has grown into the dominant way to store music on personal computers and/or transmit it over the Internet. It will be interesting if a compressed unknown audio fr...
متن کاملPanako - A Scalable Acoustic Fingerprinting System Handling Time-Scale and Pitch Modification
This paper presents a scalable granular acoustic fingerprinting system. An acoustic fingerprinting system uses condensed representation of audio signals, acoustic fingerprints, to identify short audio fragments in large audio databases. A robust fingerprinting system generates similar fingerprints for perceptually similar audio signals. The system presented here is designed to handle time-scale...
متن کاملQuad-Based Audio Fingerprinting Robust to Time and Frequency Scaling
We propose a new audio fingerprinting method that adapts findings from the field of blind astrometry to define simple, efficiently representable characteristic feature combinations called quads. Based on these, an audio identification algorithm is described that is robust to large amounts of noise and speed, tempo and pitch-shifting distortions. In addition to reliably identifying audio queries...
متن کاملImproving Audio Watermark Robustness Using Stretched Patterns against Geometric Distortion
One of the problems for audio watermarks is robustness against signal processing causing de-synchronization of the pseudo-random sequences. To tackle the problem, we previously introduced an audio watermarking method using a two-dimensional pseudo-random array, which is robust against pitch shifting and random stretching to some extent. In this paper, we explain a modification to the detection ...
متن کامل